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Xr; yrux (-<-k:a>?, v/e nave clcvotcfl iiracli tima and effort prpgtamming,- ,;,^v 
oorTtp,4.ri^;on o:avaplea of tl>^i ''^DP-ll and other raachinesr , The . ^m:.'|'*' 
.■X'UVtXt:G V.ave I'jrjcn qui.tG cicfaif.icr..ut - tha PDP-ll turns,, out- to -fv, 
ohs uignir.iatmtlY '-xhRad of nil othor small -eomEiuters .In- l>oth;_ ,;i:?[;V 
y,-,.je',I tino moAiury economy, -, ,, .'''■'•'* '' ' ;: 

' \ ' ' ' .' ' ^j 
yC'.iVj of the most startling statistics ares '; . v^^^,:^ 

™-Uxo p-DP-llh rir";3 prcgra"ao 5 timos faster than; ' ^n^ tjgtl^i-i 

. ;snm 11 computer' t ■ .^l-fy-i"* 

: ■ • .'. -".,"ro<)rar.v.i cun hn -coaod oi i:ho PDP-li using 33%'f.mmx:- :'^^:'f^ 

\u\:n than :ia equivcri.,.in!: >5irogram on any other ^computer;:;',V 

. • *#«-Are nattjraJ-l^Jt-iialte axcihed rtbout these statistics. ..'.'tl^. \ ••')':. 
"' .;-i^'imi^%on^.€h^'meci'~hrxcX w«3 rj'at ':rora tha groups producirif,' the.. '/■;_:■; 






'' '■••■t^Jftt^re iiidi'cater. that the miichinc it; very easy to' prograttfC;^ 
'"It'- slmoat seems too good to ho true, but I hav#- checked .thrts^^^ 
-^ha t-schnical material myself and find that.lt is aceuir^ie* ***-* 



Th'^. H::ri; /.tiled programi-ning co.apjirirson o:;<amples apt:>ear in -ft^im 
Ma/.; T, ■.-«.■: A PH 16-3 und PM 3r>--i 
hero , 



Mi 



!r>-l, I shall suimmarize t>|jf-.i?®#iiife.#<f;.'''.^ 






.'i'j \i..ii;Vive tcDt proh'i^i-rnj . Two arc principally characSter-" , , 
ori'.-nt^-'d mxd. -cue c:.rlt]\)i...jtic. Two of the routines will re%uii;.&'i 
to ';•■; "s'ahro«tiniacd*% in that they picked up arguraen(ts.;;-.^rom -c^ 
rtrac -In.a or reatorod valueiij* r-'i>s p.roblems were: '""'"■ ■=•""■•—■' 

1. Move character r; in edit 

2, Multiply sub coat in 1-^ 
?. . 'f'ol© ranee chedc 

.-■> „ Hic±ograin coatpallr-'rc.ioa 

'i „ ■ D?»c.lm-il to 'bibary cor. v or a ion 

#DP-lia- (l;ast 'e-iDP^ll) ^ 
■I'DP-llB (ol'QW PDP-liy'\ 

??jp~a , ■ 

'Data. General KOVA 
^''ariar* 5201 
Varian 6201 
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Hav7l3tt-Paokard 2114.\ 

Intox-flata 13 (internt;! instruction set) 

Interdata 13 (external instruction set) 

PDP-X (a machine v/e never built) 

SPC-12 

PDC~Q08 

The fo].lowing tabic s'aovrc the numhor of bite need to enpode 
each of the five problems on the 12 instruction sots: 

1 2 3 4 5 . 



PDP-ll^ 


200 


280 


240 


320 


200 


PDP-llB 


2B0 


280 


240 


3 20 


200 


PDP-8 


468 


432 


420 


376 


256 


NOVA 


640 


272 


3 20 


400 


368 


B201 


752 


488 


632 


496 


320 


620X 


1130 


49S 


780 


896 


440 


2114A 


• 1040 


304 


576 


656 


340 


13 (mi!) 

X3 (RXT) 


864 


1100 


1200 


784 


512 


784 


640 


688 


640 


384 


POP-X 


576 


368 


43 2 


448 


304 


.SPC-12 ■ 


824 


1030 


1420 


752 


344 


PDC-000 


664 


1080 


1320 


728 


632 



orma lining this to 100 for th© PDP-11, v/e gets 

1 2 3 4 



avG 



pDp-im 


100 


100 


100 


100 


100 


100 


PDP*11B 


100 


100 


100 


100 


100 


100 


POP.-S- 


167 


160 


175 


118 


128 


150 


NOV/I. 


229 


97 


133 


125 


184 


154 


520X 


269 


171 


264 


155 


160 


203 


620X 


404 


177 


326 


280 


220 


281 


2114A 


372 


137 


240. 


206 


170 


225 


13 (INT) 


309 


393 


500' 


245 ■ 


256 


340 


13 (E)CT) 


280 


229 


237 


200 


192 


237 


POP-X 


206 


• 131 


ISO 


140 


152 


161 


SPC-12 


295 


358 


592 


235 


172 


332 


PDC-80S 


237 


386 


550 


228 


316 


343 



The average over the five problems 
the right hand col'uian. It is ver^ 
clifferencGs bGtv;een machinGs. Sc 
12, PDC-808) actually use raor 



fcJPC 



s for each machine appears in 
y sic;3iif iccint to notice the 
Somo niC!.chinos (Interdata Internal, 
than three timos as many bits 
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to oncoflo a problem than the bast rr^achine in this category. 

Tho nurabor of macliine cycles to execute, each problera was also 
tallied. These reoults v/ere i 

1 2 3 4-5 



POP-'llA 


4260 


149 


1210 


12400 


49 


PDP-llR 


8210 


297 


2220 


20SOO 


98 


PDP-S 


50500 ^: 


313 


2640 


21600 


198 


MOVA 


31200 


135 


1530 


15500 


190 


iVAOl 


3fJ200 


260 


4880 


40000 


218 


C<20X 


43100 


451 


2250 • 


51G00 


190 


2ii4;i 


17400 


. 149 


1750 


33800 


133 


:f,3 (XHT) 


10200 


294 


4030 


3 2700 


125 


If 3 (T'iX'C) 


21000 


500 


2560 


40 500 


102 


fDP'-X 


11400 


isa 


13 20 


16500 


85 


SPC~3 2 


32200 


843 


10500 


57400 


195 


PDG-noo 


27000 


979 


12000 


63 700 


529 



A-gEtln normalizing to a 100 for the ?DP-lli\, wo get: 



4 



avc 



eye 



IPC 



PDP-13A 


100 


100 


100 


100 


100 


100 


1„0 


100 


PDP-llB 


193 


199 


3,82 


166 


200 


188 


3.0 


'564 


FOP-8 


1190 


"210 


218 


174 


404 


439 


1.5 


658 


wm 


734 


91 


127 


134 


388 


294 


2.6 


764 


5201 . 


1210 


175 


404 


3 22 


446 


511 


1.5 


757 


620X 


1010 


303 


187 


418 


338 


461 


1.0 


461 


21l4?i 


409 


100 


145 


272 


272 


239 


2.0 


478 


13 (I?TO) 


428 


195 


337 


264 


256 


295 


3.0 


887 


13 <S;tT) 


'494 


336 


212 


3 26 


208 


315 


1.8 


567 


PDP'->: 


268 


125 


109 


133 


173 


161 


1.0 


161' 


S.PC-'i2 


760 


567 


859 


462 


400 


611 


2.4 


1460 


PDG-808 


635 


656 


992 


514 


1080 


775 


8,0 


6190 



Ite number of cycles for each machxno viclk averaged in the column 
headed "eive". The next colum.. ^^.^v,) contains the cycle tiiaa for 
the machine. Finally, the right hand colvL';n gives the norraalircoci 
averaged speed for the five probleias on each raachine . Again there 
la a treracndous spread. The fact that tha :?DP-11 vrins so strongly 
in both spesed and memory economy ^ couple v/ifh its low price, clearly 
indicates that the prodxict has unlimitod potential. 



We have compiled ratings by teilcing the J.nverce of the bit and s"je.2d 
noruiali^ed averages for each machine. Trie plots for both speed" and 
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mQu\<my oconomy are < 



lr,ij -■.«. p'bP-iiA 



%o ■ 









1^V:f 






Cs 



Mo^iA^ l.''.:!oi. 






lo.i 






{.:•:■> 



\ii\-- f., 



D.<:o;i 






i-SAT/i (&'^n,£"Xl\^ V'bC-JioijK^ SfC-sa 



Z.O 



,3 _1. 



0.nls,grai:yi'acally indicates hov^ the PDP-lia simply' ruas . aA^av frora ■ ■- 
alltho other machinGs- in :proca3sinr/ Bo-ed and'hov/ the entire -v • 
^iJl>iX (:amlly hHViaily 6&f:or,tv. r,ll c!:V^-: con^pr^hcrs in bit efficlevcv,: 
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Afj a y,iru\l raUiug, v/o Civoi\:j(-"'. t^o cpcca v/iv.h the bit efficiency,- 
counting each eqvigilly^ The- re faults spo«k for thomselvass -*' 



/ft'^ ■;---;';..:^-:','. 
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■ — tjo^vr, 
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I. Introduction 

When the PDP-11 instruction set design was completed, a.numbeE of 
people asked how. it compared with other computers. To get the 
answer, it v/as decided to compare typical small computer problems 
on the PDP-11, PDP-8 and the NOVA. Five examples were chosen - 
■ ".some character oriented and ^some arithmetic . Each problem was 
coded for the three computers. Instruction bits and execution 
cycles were counted. The results of these counts were used to 
evaluate the performance of the various instruction sets. 

The PDP-11 turned out to be best both in speed and memory economy. 
Specif icallyj . ' . 

— the PDP-8 used 50% more instruction bits than the PDP-11 

-— i=.-the NOVA used 54% more instruction bits that the PDP-11 

* — the PDP-llA runs the example problems 6 times faster, than 

the PDP-8 , 

—the PDP-llA runs the '"'problems 7 times faster than the NOVA 

— the PDP-llB is 20% faster than the PDP-8 

—the PDP-8 is 15% faster than the NOVA 

V^ The study procedure which we follovred is defined, in Section II. 
\ The results of the study are given in Section III, Appendix A 
V contains the" actual code which we generated for the test problems. 



\ 



II . Data Gathering 

1/ ■ ■ " • 

This secti'ion discusses the ij:ist apiy.ications in the study and 

describes the evaluation pi'-cedurf^V 

1 . . 

A. /Applications _ ' . , 

We used five test problems. Two were principally cl-jaracter oriented 
and two arithmetic. Another operated on both 8-bit and 16-bit data. 
Two of the routines were required to be "subroutinized" , in that they 
picked up arguments from a calling routine cind restored values. 
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i - ■ ' 

• "1 

1, Move -Characters and Edit 

This problem tested the ability of the machine to move characters • 
(S~bits) from one memory block to another. It also made use of array 
indices - which had ranges of less than 256 and also which had ranges ! 
greater than 256. The ability of the machine to branch on character 
matches was also important, ' ' 

The input characters were broken dovm into a variable nxaraber of lines, 
and terminated with a special end of record character. Each line was 
of variable length, terminated by a special end of line character. 
The output character arjray was the same as the input ,' except" that the 
lines were edited 'to a fixed length'. The end of line, end of record 
characters were removed, as well as all embedded blanks. The indivi- 
dual lines were blank filled on the right to make them all the same lengtV 
(the output, line length vras greater than the number of nonblank 
• characters in each input line)'. 

•An example of the operation of this routine is: ' 



K G r^ ' e: A T k) u n s . er a • >v ^ F '^ L „ /^ R <r E. 

C A t E C o ?^i r' U 1- E R ^ $ 





"a. 



\ ^A C^ /Q E A t t^ U Mg E {<r 
V ^-^-~ ■— - - ^ . 




* J" ' 



' ->:? -, 
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The flow chart is: 



\ 




. 


MOV" Ci.rh^r 




A.'Jt-' rc>rr 


Eo\ 7 





where A is the input array 

B is the output character array 

I is an index to the input . . ' 

J is an index to. the output 

K counts the number of characters in an output line 
■ X holds the current input character 
N ia the ^variable) number of characters in an output line 
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Notice that the hoxes in the flov/ chart are nunihered. The nxmber, of 
times each box is executed Is given hy the assoicated nurnher in ^ 
parenthesis. .', ! 

We assumed that there were 10 input lines of 30 characters each, 20 : 
of which were nonblank. The output line length was 100. Bojj; 1 is j 
executed once for initialization. Boxes 2, 10 and 11 are entered 10 I 
times - once for each line. Boxes 3, 4, 5 and 7 are executed for each 
input character - in our example 300 or 30 per line. Box 6 is executed 
for only the nonblank characters - thus 200 times or 20 times per line. 
Boxes 8 and 9 are used to fill the input lines with blanks. This 
occurs 70 times per line for a total of 700 in the. problem, 

2. Multiply Subroutine . . ■ . ' 

This problem* had a number of objectives - testing the ability of the 
'machine to set up subroutine linkage, to sense bit configurations and 
branch conditionally and to easily shift double words. The test is 
also important because the multiply operation is commonly used. 

The program first picked up two 16-bit operands from the calling program. 
These were multiplied together by the usual shifting and adding method. 
The 32-bit result was returned to the calling program. The routine 
dperated only on unsigned integers. 
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The flow chart is: 



1 

'( 

( 



Liifr orx, l-cx \ 




/ 



SHIFT X 

T<- r -i- s 



t 




where :|C is a double word to hold one of the operands and the result 

i^ is one of the operands 

B is the other operand 

I is an index to the number of bits in each operand . 

N is the number of bits in each oparand 
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We assumed 16-bit operands in all oases. On the PDP-8, the program 
was written for 12~bit words, but the cycle count was later adjusted • 
as if the main loop was executed 16 times* X-Je assumed that the first" 
input argument contains exactly 8 one's. 

Box 1 is initialization and hence executed once. Boxes 2, 3 and 5 are 
entered for each bit - 16 times. Box 4 is executed only for 1 bits in 
the" first operand - 8 times according to pur assumptions. 

3. Tolerance Check 



I 



.. I 



The objective of this problem is to test arithmetic comparison capa- 
bilities and the ease in which the machine can index through an array 
of 16-bit guan»tities. Subroutine linkage V7as also considered in that the 
calling sequence to this problem was more complicated than that to the 
multiply subroutine. 

•The program picked up an aiTray address, the count of the number of elements 
in the array and two tolerance limits from the calling program. It indexed 
through the array, checking each element against the low and high limits. 
If all elements were within tolerance, the program returned an output 
value of zero to the caller. If any were out of tolerance, the index 
in the array of the offender was returned. 
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The flow chart isi 



\ 



__L. 



r<-/ 







Box t' (A 






l^STUrl!^ 



SOK 7 '"A 






-> /?£ri/.'^fi 






.? r.-^. 







~> .P,ETUfi..iJ 



r<- r -if 



Tol /r /v.' /Ia' c £ CHE CK 



where -I is an index to the nuraBex- of eleruants in the array 
N is 'the number of eleraents in the array 
A is the array of numbers 
X is the low limit 
Y is the high limit 
'K is the return argument 
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For timing considerations, we .assumed that there were lOO* entries in the 
array and that they were all in tolerance. Boxes 1 and 6 are executed ■ 
once, as they involve initialization and termination, respectively. Boxes 
2,3,4 and 5 are executed 100 times - once per array entry. Boxes 7 and 
8 are not executed because of our assumption that all entries are in j 

tolerance. i 

". i 

4. Histogram Compilation 

It tests the ability of the machine to randomly index to memory 
arrays and to increment 16-bit memory integers. 

The input is an array of 1,000 16-bit numbers, with values normally in the 
range of 1 thiysugh 100. The program must contain code to ignore other 
v&lues - 0*or 101 through 256. The output is a memory array of 100 16-bit 
numbers. These contain the counts of occurences of the 100 possible input 
values. For example, if the 16 -bit number 20 occurs exactly 15 times in 
•the input array, the contents of the 2 0th element of the output array 
must be 15. 



Comparison of PDP-11, PDP-8' and the ITOVA 
The flow -chart for this problem is: 



.^(^•.-.U s- 



11 January . 1569/.3 



/ 







where A Ig tho input data array 

B is the output histogram array 

I is an index through tho output array 

J is an index through the input array 
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Boxes 1 and 4 are for initialisation and hence, executed once. Boxes 
2 and 3 are used to zero the output arrciy and are executed once per 
output entry of 100 times. Boxes 5 through 10 are entered once per 
input entry, 1,000 times total. ' . . ■ 

5. Decimal to Binary Conversion ^* ^ 

The objective of this test was to determine how the machines performed 
in this rather common application. The problem also tests character 
manipulation and the ability to do specialized multiplication using 
shifts and adds. ' 

The input is a five character array and the output is an unsigned 
integer less than 32758. On the PDP-8, the routine was written for a 
12-bit operation, but the cycle counts vrere multiplied by 15. 

» 
The flow chart 1st 



\ 



/ 



/ 



; STO 










l>tCir..f{^ TO 



K<=r loK-i-J' I 



r .- 1 






A 



where A is the input character array 

I is an index to the input characters 

J is a temporary storage location for each binary digit 

K is the binary result 
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B . Procedure 

After the problems had been coded j v.-o. counted the number of bits 
and cycles' used. The bit count iz-iVol\v;d a tally of the instruction 
v7ords and storage for conGtant' data . Toroporary storage v/as not 
tallied^ as it could be chared ruTioiTg routines or was' containe'd in 
general registers. The word count: was simply multiplied by the 
vrord length t.o get the program bit count , 

Each prograra wara partitioned as specified by the boxes in the flow 
charts. The number of raachine cycles corresponding to each box 
v/as counted. This number v/as then multiplied by the corresponding 
box frequency, vrhich appei^rs with the box number on the flow charts. 
These products were then added together for each prograra, giving 
the cycle t^tal for the problem. 

Time figures for each computer vrc-;re computed by multiplying, the 
average cycle counts by the cydle time. This is assumed to be 
1.5 microseconds for the PDP-8, 3.0 for the PDP-llB, 1.0 for the 
PDP-llA and 2,6 for the KOVA, The times for the latter were computed 
by assuming that every instruction took 2 cycles except for JKP 
and JSR which took 1. 



Ill, Results 

' The following tabley sumnnarises the number of bits used for each 
problem.* ^ 



\ 



prob 

1 
2 
3 
4 

5 



PDP-11.^ 

280 
280 
240 
320 
200 



PDP-JJ.B 

280 
240 
3 20 
200 



PDP~8 

458 
432 
420 
376 
256 



MOVA 

640 
272 
320 
400 
368 



t 
I<iormaliaing this to 100 for the PDF-lly this becomes: 



prob 

1 
2 

3 
4 
5 



PDP-11?\ 

100 
100 
100 
100 
100 



100 
100 
100 
100 
100 



PDP-8 


NOVA 


167 


229 


160 


97 


175 


133 


118 


125 


128. 


184 



iverage 



100 



100 



150 



154 



V 
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The PD?~8 and the NOVA both have th:; liability of no btrte handling ' 
instructions. Subroutines to load and store bytes V7ere coded and • 
called V7hcn relevant, but the bit count for thetsa is not included • 
(aBsuming that the routines are shared over a large nxomber of 
programs) , ' • ^ ■ 

The nurnbor of bits used by the PD?~Q varied frora 18% more than the 
PDP-11 for the 'histogram e^ca.Tiple to 75^-^ more in the tolerance check, 
The NOVA porforiried fairly well in this area for 16 bit arithmetic, 
but very poorly for character manipulation ej - even though a sub- 
routine call v.-as used. 



B. SpcGd .'■-■', ' '■'•■-.' 

Appendix B contains a . detailed listing of the cycle counts for 
each problem on the three computers. The table for each problem 
has one rc«-7 for each box on the flov/ chare. The nximber of times 
that box is executed is given in the frequency column. Then for 
each computer, the nxamber of eyelets to a::ecuc3 the box and the 
cycle total is given. The cycle totals are added together to form 
a grand total of memory cycles for each computer. A summ.ary of 
the se i s : 



prob 


PDP-llA 


PDP~11B 


PDP-8 


NOVA 


1 


■ 4260 
A49 


8210 


50500 


31200 


2 


297 


313 


135 


3 


1210 


2220 


2640 


1530 


4 ; 


12400 


20600 


21600 


16600 


5 


49 


98 


198 


190 



Normalized to the PDP-IL^, this becomes! 



\ 



prob 
■' 5 



average 
time 



PDP-llA 

" iX)0 
100 
100 
100 
100 

100 

100 



PDP- 


-IIB 


' 


• 




■^ -.3 




19<J 




182 




166 




200 




les . 




554 



PDP-8 

11^0 
210 
218 
174 
248 

408 

610 



NOVA 

734 
91 
127 
134 
258 

269 

700 



comparison- of PDP-11, PDP-8 and th3 NO'/A .^^ jtauIrfweVJO 

' • '■ ■ ■•''!.. 

Thfc- time rov? above was computed hy nvaltiplying tlie av^age cycles •■ 
by the memory cycle time. As was r.Gntioned before, the bJOVA , , 

tSmes V7ere simplified by assuraing all inctructions tooK ^ n-iemory , 
cycles except JMP and J,S.R vmicli tooVc 1. The NOVA cycle is ,; 

assumr-a to be 2.6 micr o second s. Tho superr-ority of the PDP-llA is 
quite~ startling and it is cignificant that the PDP-llB eages out , 

both the PE)?-8.and the NOVA. ■ '; 
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